495 research outputs found

    DonnĂ©es confidentielles : gĂ©nĂ©ration de jeux de donnĂ©es synthĂ©tisĂ©s par forĂȘts alĂ©atoires pour des variables catĂ©goriques

    Get PDF
    La confidentialitĂ© des donnĂ©es est devenue primordiale en statistique. Une mĂ©thode souvent utilisĂ©e pour diminuer le risque de rĂ©identification est la gĂ©nĂ©ration de jeux de donnĂ©es partiellement synthĂ©tiques. On explique le concept de jeux de donnĂ©es synthĂ©tiques, et on dĂ©crit une mĂ©thode basĂ©e sur les forĂȘts alĂ©atoires pour traiter les variables catĂ©goriques. On s’intĂ©resse Ă  la formule qui permet de faire de l’infĂ©rence avec plusieurs jeux synthĂ©tiques. On montre que l’ordre des variables Ă  synthĂ©tiser a un impact sur l’estimation de la variance des estimateurs. On propose une variante de l’algorithme inspirĂ©e du concept de confidentialitĂ© diffĂ©rentielle. On montre que dans ce cas, on ne peut estimer adĂ©quatement ni un coefficient de rĂ©gression, ni sa variance. On montre l’impact de l’utilisation de jeux synthĂ©tiques sur des modĂšles d’équations structurelles. On conclut que les jeux synthĂ©tiques ne changent pratiquement pas les coefficients entre les variables latentes et les variables mesurĂ©es.Confidential data are very common in statistics nowadays. One way to treat them is to create partially synthetic datasets for data sharing. We will present an algorithm based on random forest to generate such datasets for categorical variables. We are interested by the formula used to make inference from multiple synthetic dataset. We show that the order of the synthesis has an impact on the estimation of the variance with the formula. We propose a variant of the algorithm inspired by differential privacy, and show that we are then not able to estimate a regression coefficient nor its variance. We show the impact of synthetic datasets on structural equations modeling. One conclusion is that the synthetic dataset does not really affect the coefficients between latent variables and measured variables

    Conception de microARNs pour attenuer l'expression de genes

    Get PDF
    Les microARNs appartiennent Ă  la famille des petits ARNs non-codants et agissent comme inhibiteurs des ARN messagers et/ou de leurs produits protĂ©iques. Les mi- croARNs sont diffĂ©rents des petits ARNs interfĂ©rants (siARN) car ils attĂ©nuent l’ex- pression au lieu de l’éliminer. Dans les derniĂšres annĂ©es, de nombreux microARNs et leurs cibles ont Ă©tĂ© dĂ©couverts chez les mammifĂšres et les plantes. La bioinforma- tique joue un rĂŽle important dans ce domaine, et des programmes informatiques de dĂ©couvertes de cibles ont Ă©tĂ© mis Ă  la disposition de la communautĂ© scientifique. Les microARNs peuvent rĂ©guler chacun des centaines de gĂšnes, et les profils d’expression de ces derniers peuvent servir comme classificateurs de certains cancers. La modĂ©lisation des microARNs artificiels est donc justifiable, oĂč l’un pourrait cibler des oncogĂšnes surexprimĂ©s et promouvoir une prolifĂ©ration de cellules en santĂ©. Un outil pour crĂ©er des microARNs artificiels, nommĂ© MultiTar V1.0, a Ă©tĂ© crĂ©Ă© et est disponible comme application web. L’outil se base sur des propriĂ©tĂ©s structurelles et biochimiques des microARNs et utilise la recherche tabou, une mĂ©taheuristique. Il est dĂ©montrĂ© que des microARNs conçus in-silico peuvent avoir des effets lorsque testĂ©s in-vitro. Les sĂ©- quences 3’UTR des gĂšnes E2F1, E2F2 et E2F3 ont Ă©tĂ© soumises en entrĂ©e au programme MultiTar, et les microARNs prĂ©dits ont ensuite Ă©tĂ© testĂ©s avec des essais lucifĂ©rases, des western blots et des courbes de croissance cellulaire. Au moins un microARN artificiel est capable de rĂ©guler les trois gĂšnes par essais lucifĂ©rases, et chacun des microARNs a pu rĂ©guler l’expression de E2F1 et E2F2 dans les western blots. Les courbes de crois- sance dĂ©montrent que chacun des microARNs interfĂšre avec la croissance cellulaire. Ces rĂ©sultats ouvrent de nouvelles portes vers des possibilitĂ©s thĂ©rapeutiques.MicroRNAs belong to the family of small non-coding RNAs and act as down regula- tors of messenger RNAs and/or their protein products. microRNAs differ from siRNAs by downregulating instead of shutting down. In recent years, numerous microRNAs and their targets have been found in mammals and plants. Bioinformatics plays a big role in this field, as software has emerged to find new microRNA targets. Each individual microRNA can regulate hundreds of genes, and it has been shown that microRNA expression profiles can classify human cancers. The need for artificially created mi- croRNAs is then justified, as one could target overexpressed oncogenes and promote healthy cell proliferation. MultiTar V1.0, a tool for creating artificial microRNAs, has been implemented and is available as a web application. The tool relies on structural and biological properties of microRNAs and uses a Tabusearch metaheuristic. A typical biological problem is presented and it is shown that an in-silico microRNA has in-vitro effects. The 3’UTR sequences of E2F1, E2F2 and E2F3 were given as input to the tool, and predicted microRNAs were then tested using luciferase essays, western blots and growth curves. At least one microRNA is able to regulate the three genes with luciferase essays and all of the created microRNAs were able to regulate the expres- sion of E2F1 and E2F2 with western blots. Growth curves were also studied in order to investigate overall biological effects, and reduction in growth was observed for all solutions. Results obtained with the predicted microRNAs and the target genes open a new door into therapeutic possibilities

    Conception et mise au point d'un module de connexion réseau modulaire, bidirectionnel en courant et isolé

    Get PDF
    La nĂ©cessitĂ© de rĂ©duire le temps de dĂ©veloppement des convertisseurs de puissance et d’augmenter l’efficacitĂ© des Ă©quipements de test ne cesse de croitre. Le dĂ©veloppement continuel de technologie utilisant l’électricitĂ©, tel l’électrification de l’automobile contribue Ă  accĂ©lĂ©rer cette tendance. De plus, le prix de l’énergie Ă©tant sans cesse Ă  la hausse, l’intĂ©rĂȘt d’introduire des Ă©quipements de tests rĂ©gĂ©nĂ©ratifs pour la validation des convertisseurs de puissance gagne en importance. Les efforts dĂ©ployĂ©s dans le projet de ce mĂ©moire font suite au dĂ©veloppement des Ă©mulateurs de charges rĂ©gĂ©nĂ©ratifs. Ce type de charge intelligente nĂ©cessite une interface de connexion avec le rĂ©seau. Pour certaines applications, cette interface doit ĂȘtre bidirectionnelle et prĂ©senter une isolation galvanique. Par exemple, les amplificateurs de puissance utilisĂ©s pour Ă©muler le comportement d’une source de tension alternative, tels un rĂ©seau de distribution ou un moteur, peuvent fonctionner dans les quatre cadrans de couranttension. Il importe donc de fournir une connexion bidirectionnelle avec le rĂ©seau Ă  ce type de convertisseur. D’autre part, dans la foulĂ©e du dĂ©veloppement des convertisseurs multiniveau, la caractĂ©ristique isolĂ©e prend tout son sens, permettant de connecter plusieurs niveaux de tension de façon alĂ©atoire. Le dĂ©veloppement d’une unitĂ© de connexion rĂ©seau effectuant une conversion CA-CC est donc traitĂ© dans ce travail. La conception et l’optimisation d’un convertisseur CC-CC et d’un convertisseur CA-CC sont traitĂ©es. La modĂ©lisation, la simulation, la conception et les tests expĂ©rimentaux sur un prototype de 5kW sont effectuĂ©s. La stabilitĂ© de l’interconnexion entre les deux convertisseurs est Ă©galement analysĂ©e et testĂ©e en pratique

    An adaptive multi-agent system for task reallocation in a MapReduce job

    Get PDF
    International audienceWe study the problem of task reallocation for load-balancing of MapReduce jobs in applications that process large datasets. In this context, we propose a novel strategy based on cooperative agents used to optimise the task scheduling in a single MapReduce job. The novelty of our strategy lies in the ability of agents to identify opportunities within a current unbalanced allocation, which in turn trigger concurrent and one-to-many negotiations amongst agents to locally reallocate some of the tasks within a job. Our contribution is that tasks are reallocated according to the proximity of the resources and they are performed in accordance to the capabilities of the nodes in which agents are situated. To evaluate the adaptivity and responsiveness of our approach, we implement a prototype test-bed and conduct a vast panel of experiments in a heterogeneous environment and by exploring varying hardware configurations. This extensive experimentation reveals that our strategy significantly improves the overall runtime over the classical Hadoop data processing

    Fluorescent labeling in semi-solid medium for selection of mammalian cells secreting high-levels of recombinant proteins

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Despite the powerful impact in recent years of gene expression markers like the green fluorescent protein (GFP) to link the expression of recombinant protein for selection of high producers, there is a strong incentive to develop rapid and efficient methods for isolating mammalian cell clones secreting high levels of marker-free recombinant proteins. Recently, a method combining cell colony growth in methylcellulose-based medium with detection by a fluorescently labeled secondary antibody or antigen has shown promise for the selection of Chinese Hamster Ovary (CHO) cell lines secreting recombinant antibodies. Here we report an extension of this method referred to as fluorescent labeling in semi-solid medium (FLSSM) to detect recombinant proteins significantly smaller than antibodies, such as IGF-E5, a 25 kDa insulin-like growth factor derivative.</p> <p>Results</p> <p>CHO cell clones, expressing 300 ÎŒg/ml IGF-E5 in batch culture, were isolated more easily and quickly compared to the classic limiting dilution method. The intensity of the detected fluorescent signal was found to be proportional to the amount of IGF-E5 secreted, thus allowing the highest producers in the population to be identified and picked. CHO clones producing up to 9.5 ÎŒg/ml of Tissue-Plasminogen Activator (tPA, 67 kDa) were also generated using FLSSM. In addition, IGF-E5 high-producers were isolated from 293SF transfectants, showing that cell selection in semi-solid medium is not limited to CHO and lymphoid cells. The best positive clones were collected with a micromanipulator as well as with an automated colony picker, thus demonstrating the method's high throughput potential.</p> <p>Conclusion</p> <p>FLSSM allows rapid visualization of the high secretors from transfected pools prior to picking, thus eliminating the tedious task of screening a high number of cell isolates. Because of its rapidity and its simplicity, FLSSM is a versatile method for the screening of high producers for research and industry.</p

    Allocation équitable de tùches pour l'analyse de données massives

    Get PDF
    L'URL de l'ouvrage est la suivante:http://www.cepadues.com/livres/jfsma-2016-systemes-multi-agents-simulations-9782364935594.htmlInternational audienceMany companies are using MapReduce applications to process very large amounts of data. Static optimization of such applications is complex because they are based on user-defined operations, called map and reduce, which prevents some algebraic optimization. In order to optimize the task allocation, several systems collect data from previous runs and predict the performance doing job profiling. However they are not effective during the learning phase, or when a new type of job or data set appears. In this paper, we present an adaptive multiagent system for large data sets analysis with MapReduce. We do not preprocess data and we adopt a dynamic approach, where the reducer agents interact during the job. In order to decrease the workload of the most loaded reducer - and so the execution time - we propose a task re-allocation based on negotiation.De nombreuses entreprises utilisent l'application MapReduce pour le traitement de donnĂ©es massives. L'optimisation statique de telles applications est complexe car elles reposent sur des opĂ©rations dĂ©finies par l'utilisateur, appelĂ©es map et reduce, ce qui empĂȘche une optimisation algĂ©brique. Afin d'optimiser l'allocation des tĂąches, plusieurs systĂšmes collectent des donnĂ©es Ă  partir des exĂ©cutions prĂ©cĂ©dentes et prĂ©disent les performances en faisant une analyse de la tĂąche. Cependant, ces systĂšmes ne sont pas efficaces durant la phase d'apprentissage ou lorsqu'un nouveau type de tĂąches ou de donnĂ©es apparait. Dans ce papier, nous prĂ©sentons un systĂšme multi-agents adaptatif pour l'analyse de donnĂ©es massives avec MapReduce. Nous ne prĂ©-traitons pas les donnĂ©es et adoptons une approche dynamique oĂč les agents reducers interagissent durant l'exĂ©cution. Nous proposons une rĂ©-allocation des tĂąches basĂ©e sur la nĂ©gociation pour parvenir Ă  faire dĂ©croitre la charge de travail du plus chargĂ© des agents reducers et ainsi rĂ©duire le temps d'exĂ©cution

    A Location-Aware Strategy for Agents Negotiating Load-balancing

    Get PDF
    International audienceWe study a novel location-aware strategy for distributed systems where cooperating agents perform the load-balancing. The strategy allows agents to identify opportunities within a current unbalanced allocation , which in turn triggers concurrent and one-to-many negotiations amongst agents to locally reallocate some tasks. The tasks are reallocated according to the proximity of the resources and they are performed in accordance with the capabilities of the nodes in which agents are situated. This dynamic and ongoing negotiation process takes place concurrently with the task execution and so the task allocation process is adaptive to disruptions (task consumption, slowing down nodes). We evaluate the strategy in a multi-agent deployment of the MapReduce design pattern for processing large datasets. Empirical results demonstrate that our strategy significantly improves the overall runtime of the data processing

    Stratégie situationnelle pour l'équilibrage de charge

    Get PDF
    National audienceWe study a novel location-aware strategy for distributed systems where cooperating agents perform the load-balancing. The strategy allows agents to identify opportunities within a current unbalanced allocation, which in turn triggers concurrent and one-to-many negotiations amongst agents to locally reallocate some tasks. The tasks are reallocated according to the proximity of the resources and they are performed in accordance with the capabilities of the nodes in which agents are situated. This dynamic and ongoing negotiation process takes place concurrently with the task execution and so the task allocation process is adaptive to disruptions (task consumption, slowing down nodes). We evaluate the strategy in a multi-agent deployment of the MapReduce design pattern for processing large datasets. Empirical results demonstrate that our strategy significantly improves the overall runtime of the data processing.Nous Ă©tudions une stratĂ©gie qui tient compte de la localitĂ© des ressources pour Ă©quilibrer les charges dans un systĂšme distribuĂ©. Cette stratĂ©gie permet aux agents coopĂ©ratifs d'identifier une allocation non Ă©quilibrĂ©e, voire de dĂ©clencher des enchĂšres concurrentes pour rĂ©allouer localement certaines des tĂąches. Les tĂąches sont rĂ©allouĂ©es en tenant compte de l'accessibilitĂ© des ressources pour les agents ; elles sont exĂ©cutĂ©es conformĂ©ment aux capacitĂ©s des noeuds de calcul sur lesquels se trouvent les agents. Ce processus de nĂ©gociation dynamique et continu est concurrent Ă  l'exĂ©cution des tĂąches, ce qui permet d'adapter l'allocation des tĂąches aux perturbations (exĂ©cution de tĂąche, chute de performance d'un nƓud). Nous Ă©valuons cette stratĂ©gie dans le cadre du dĂ©ploiement multi-agents de MapReduce. Ce patron de conception permet le traitement distribuĂ© de donnĂ©es massives. Les rĂ©sultats empiriques dĂ©montrent que notre stratĂ©gie amĂ©liore significativement le temps d'exĂ©cution du traitement d'un jeu de donnĂ©es
    • 

    corecore